Efficient generated libraries for asynchronous derivative computation
نویسندگان
چکیده
The computation of derivatives via automatic differentiation is a valuable technique in many science and engineering applications. While the implementation of automatic differentiation via source transformation yields the highest-efficiency results, the implementation via operator overloading remains a viable alternative for some application contexts, such as the computation of higher-order derivatives or in cases where C++ still proves to be too complicated for the currently available source transformation tools. The Rapsodia code generator creates libraries that overload intrinsics for derivative computation. In this paper, we discuss modifications to Rapsodia to improve the efficiency of the generated code, first via limited loop unrolling and second via multithreaded asynchronous derivative computation. We introduce the approaches and present runtime results.
منابع مشابه
A Mathematical Approach to a Low Power Fft Architecture
Architecture and circuit design are the two most effective means of reducing power in CMOS VLSI. Mathematical manipulations have been applied to create a power efficient architecture of an FFT. This architecture has been implemented in asynchronous circuit technology that achieves significant power reduction over other FFT architectures. Multirate signal processing concepts are applied to the F...
متن کاملAsynchronous Bulk File I/O in Titanium, a High-Performance SPMD Java Dialect
Titanium is a high-performance explicitly parallel SPMD dialect of Java. Currently, the runtime support for file I/O is limited to the classes present in Java, which have proven too inefficient to meet the demands of high-performance scientific applications which perform large amounts of file I/O. A new library is presented which adds support for asynchronous file I/O operations to enable maski...
متن کاملThe analytical solutions for Volterra integro-differential equations within Local fractional operators by Yang-Laplace transform
In this paper, we apply the local fractional Laplace transform method (or Yang-Laplace transform) on Volterra integro-differential equations of the second kind within the local fractional integral operators to obtain the analytical approximate solutions. The iteration procedure is based on local fractional derivative operators. This approach provides us with a convenient way to find a solution ...
متن کاملD-iteration based asynchronous distributed computation
The aim of this paper is to explain how the D-iteration can be used for an efficient asynchronous distributed computation. We present the main ideas of the method and illustrate them through very simple examples.
متن کاملParallel Data Structures for Symbolic Computation
Symbolic applications often require dynamic irregular data structures, such as linked lists, unbalanced trees, and graphs, and they exhibit unpredictable computational patterns that lead to asynchronous communication and load imbalance when parallelized. In this paper we describe several symbolic applications and their parallelizations. The main problem in parallelization of each application wa...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
دوره شماره
صفحات -
تاریخ انتشار 2010